Supervised single-channel speech enhancement using ratio mask with joint dictionary learning
نویسندگان
چکیده
More comparisons and analyses about the paper "Supervised Single-Channel Speech Enhancement Using Ratio Mask with Joint Dictionary Learning" mentioned are given in this document. Specifically, 1) comparisons between the clean spectra/IRM and reconstructed spectra/IRM in the training stage are done; 2) the influence of the speech dominant parameter performed on the SNMF algorithms are shown; 3) a Wiener-type filter and the proposed SM filters applied to GDL algorithm are performed and discussed; 4) discussions about different ways using the estimated RMs to recover the speech from the mixture are presented; 5) some more results of another 4 different noise types for the speaker-dependent case are shown. The previous four experiments have been done under the speaker-dependent case. 6) The more results for the speaker-independent case are shown to compare the performances of different algorithms. At last, the spectrograms of used interferers are given at last. As mentioned in our paper that the 3 objective measures (segSNR, PESQ, fwsegSNR) are calculated to measure the objective intelligibility and quality of recovered speech from different perspectives, although there are some correlations between them (Ma, 2009; Hu, 2008). The objective measures may be improved to various extent by different algorithms. From this perspective, the improvements of segSNR, PESQ and fwsegSNR may not be consistent between RMJDL-SM1 and RMJDL-SM2, which means one algorithm may improve one measure more than another algorithm, but the situation may be reverse for another measure.
منابع مشابه
A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملSingle-channel dynamic exemplar-based speech enhancement
This paper proposes an exemplar-based speech enhancement method based on high-resolution STFT magnitude spectrograms, where a selection of the nonnegative training data is used as the dictionary to provide a holistic nonnegative representation of the test data. We discuss how this exemplar-based model ensures that the enhanced speech signal falls on the speech manifold, which improves the quali...
متن کاملStudent-Teacher Learning for BLSTM Mask-based Speech Enhancement
Spectral mask estimation using bidirectional long short-term memory (BLSTM) neural networks has been widely used in various speech enhancement applications, and it has achieved great success when it is applied to multichannel enhancement techniques with a mask-based beamformer. However, when these masks are used for single channel speech enhancement they severely distort the speech signal and m...
متن کاملNon-negative matrix factorization with linear constraints for single-channel speech enhancement
This paper investigates a non-negative matrix factorization (NMF)-based approach to the semi-supervised single-channel speech enhancement problem where only non-stationary additive noise signals are given. The proposed method relies on sinusoidal model of speech production which is integrated inside NMF framework using linear constraints on dictionary atoms. This method is further developed to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 82 شماره
صفحات -
تاریخ انتشار 2016